Reoptimization of the Closest Substring Problem under Pattern Length Modification
نویسندگان
چکیده
is study investigates whether reoptimization can help in solving the closest substring problem. We are dealing with the following reoptimization scenario. Suppose, we have an optimal llength closest substring of a given set of sequences S. How can this information be benecial in obtaining an (l + k)-length closest substring for S? In this study, we show that the problem is still computationally hard even with k = 1. We present greedy approximation algorithms that make use of the given information and prove that it has an additive error that grows as the parameter k increases. Furthermore, we present hard instances for each algorithm to show that the computed approximation ratio is tight. We also show that we can slightly improve the running-time of the existing polynomial-time approximation scheme (PTAS) for the original problem through reoptimization. CCS CONCEPTS •Mathematics of computing→Combinatorial optimization; Approximation algorithms;
منابع مشابه
On Self-Reducibility and Reoptimization of Closest Substring Problem
In this paper, we define the reoptimization variant of the closest substring problem (CSP) under sequence addition. We show that, even with the additional information we have about the problem instance, the problem of finding a closest substring is still NP-hard. We investigate the combinatorial property of optimization problems called self-reducibility. We show that problems that are polynomia...
متن کاملOn the k-Closest Substring and k-Consensus Pattern Problems
Given a set S = {s1, s2, . . . , sn} of strings each of length m, and an integer L, we study the following two problems. k-Closest Substring problem: find k center strings c1, c2, . . . , ck of length L minimizing d such that for each sj ∈ S, there is a length-L substring tj (closest substring) of sj with min1≤i≤k d(ci, tj) ≤ d. We give a PTAS for this problem, for k = O(1). k-Consensus Pattern...
متن کاملClosest Substring Problems with Small Distances
We study two pattern matching problems that are motivated by applications in computational biology. In the Closest Substring problem k strings s1, . . ., sk are given, and the task is to find a string s of length L such that each string si has a consecutive substring of length L whose distance is at most d from s. We present two algorithms that aim to be efficient for small fixed values of d an...
متن کامل0 00 20 12 v 1 1 7 Fe b 20 00 On The Closest String and Substring Problems ∗
The problem of finding a center string that is ‘close’ to every given string arises and has many applications in computational molecular biology and coding theory. This problem has two versions: the Closest String problem and the Closest Substring problem. Assume that we are given a set of strings S = {s1, s2, . . . , sn} of strings, say, each of length m. The Closest String problem [1, 2, 4, 5...
متن کاملA Closer Look at the Closest String and Closest Substring Problem
Let S be a set of k strings over an alphabet Σ; each string has a length between ` and n. The Closest Substring Problem (CSSP) is to find a minimal integer d (and a corresponding string t of length `) such that each string s ∈ S has a substring of length ` with Hamming distance at most d to t. We say t is the closest substring to S. For ` = n, this problem is known as the Closest String Problem...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1703.06644 شماره
صفحات -
تاریخ انتشار 2017